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Abstract 

We study the fundamental problem of information spreading (also known as gossip) in dy- 
namic networks. In gossip, or more generally, fc-gossip, there are k pieces of information (or 
tokens) that are initially present in some nodes and the problem is to disseminate the k tokens 
to all nodes. The goal is to accomplish the task in as few rounds of distributed computation as 
possible. The problem is especially challenging in dynamic networks where the network topology 
can change from round to round and can be controlled by an on-line adversary. 

The focus of this paper is on the power of token-forwarding algorithms, which do not manip- 
ulate tokens in any way other than storing and forwarding them. We first consider a worst-case 
adversarial model first studied by Kuhn, Lynch, and Oshman 1331 in which the communication 
links for each round are chosen by an adversary, and nodes do not know who their neighbors for 
the current round are before they broadcast their messages. Our main result is an U,{nk/\ogn) 
. lower bound on the number of rounds needed for any deterministic token-forwarding algorithm 

Q to solve fc-gossip. This resolves an open problem raised in [33], improving their lower bound of 

I— I r2(nlogfc), and matching their upper bound of 0{nk) to within a logarithmic factor. Our lower 

bound also extends to randomized algorithms against an adversary that knows in each round 
^ the outcomes of the random coin tosses in that round. Our result shows that one cannot obtain 

significantly efficient (i.e., subquadratic) token- forwarding algorithms for gossip in the adver- 
00 sarial model of |33j . We next show that token- forwarding algorithms can achieve subquadratic 

time in the offline version of the problem, where the adversary has to commit all the topology 
changes in advance at the beginning of the computation. We present two polynomial-time offline 
token-forwarding algorithms to solve fc-gossip: (1) an 0(min{nfc, n^/k log n}) round algorithm, 
and (2) an (0(n"^), log n) bicriteria approximation algorithm, for any e > 0, which means that 
^ I if L is the number of rounds needed by an optimal algorithm, then our approximation algo- 

. . rithm will complete in 0{n'^L) rounds and the number of tokens transmitted on any edge is 

^ O(logn) in each round. Our results are a step towards understanding the power and limitation 

of token-forwarding algorithms in dynamic networks. 
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Lower Bounds 
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1 Introduction 



In a dynamic network, nodes (processors/end hosts) and communication links can appear and 
disappear at will over time. Emerging networking technologies such as ad hoc wireless, sensor, 
and mobile networks, overlay and peer-to-peer (P2P) networks are inherently dynamic, resource- 
constrained, and unreliable. This necessitates the development of a solid theoretical foundation 
to design efficient, robust, and scalable distributed algorithms and to understand the power and 
limitations of distributed computing on such networks. Such a foundation is critical to realize the 
full potential of these large-scale dynamic communication networks. 

As a step towards understanding the fundamental computation power of dynamic networks, 
we investigate dynamic networks in which the network topology changes arbitrarily from round to 
round. We first consider a worst-case model that was introduced by Kuhn, Lynch, and Oshman |33j 
in which the communication links for each round are chosen by an online adversary, and nodes do 
not know who their neighbors for the current round are before they broadcast their messages. (Note 
that in this model, only edges change and nodes are assumed to be fixed.) The only constraint on the 
adversary is that the network should be connected at each round. Unlike prior models on dynamic 
networks, the model of |33j does not assume that the network eventually stops changing and requires 
that the algorithms work correctly and terminate even in networks that change continually over 
time. 

In this paper, we study the fundamental problem of information spreading (also known as 
gossip). In gossip, or more generally, A;-gossip, there are k pieces of information (or tokens) that are 
initially present in some nodes and the problem is to disseminate the k tokens to all nodes. (By just 
gossip, we mean n-gossip, where n is the network size.) Information spreading is a fundamental 
primitive in networks which can be used to solve other problem such as broadcasting and leader 
election. Indeed, solving n-gossip, where the number of tokens is equal to the number of nodes in 
the network, and each node starts with exactly one token, allows any function of the initial states 
of the nodes to be computed, assuming that the nodes know n [33j . 

1.1 Our results 

The focus of this paper is on the power of token-forwarding algorithms, which do not manipulate 
tokens in any way other than storing and forwarding them. Token-forwarding algorithms are 
simple, often easy to implement, and typically incur low overhead. In a key result, [33] showed 
that under their adversarial model, fc-gossip can be solved by token- forwarding in 0{nk) rounds, 
but that any deterministic online token-forwarding algorithm needs Q{n\ogk) rounds. They also 
proved an Q{nk) lower bound for a special class of token-forwarding algorithms, called knowledge- 
based algorithms. Our main result is a new lower bound that applies to any deterministic online 
token-forwarding algorithm for fc-gossip. 

• We show that every online algorithm for the A;-gossip problem takes n{nk/logn) rounds 
against an adversary that, at the start of each round, knows the randomness used by the 
algorithm in the round. This also implies that any deterministic online token-forwarding 
algorithm takes 0,{nk/ log n) rounds. Our result applies even to centralized token- forwarding 
algorithms that have a global knowledge of the token distribution. 

This result resolves an open problem raised in [33], significantly improving their lower bound, 
and matching their upper bound to within a logarithmic factor. Our lower bound also enables 
a better comparison of token-forwarding with an alternative approach based on network coding 
due to [271 [28], which achieves a 0{nk/ logn) rounds using 0(logn)-bit messages (which is not 
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significantly better than the 0{nk) bound using token- forwarding) , and 0{n + k) rounds with large 
message sizes (e.g., O(nlogn) bits). It thus follows that for large token and message sizes there is 
a factor Q{mm{n, k} / logn) gap between token-forwarding and network coding. We note that in 
our model we allow only one token per edge per round and thus our bounds hold regardless of the 
token size. 

Our lower bound indicates that one cannot obtain efficient (i.e., subquadratic) token- forwarding 
algorithms for gossip in the adversarial model of [33j- Furthermore, for arbitrary token sizes, we 
do not know of any algorithm that is significantly faster than quadratic time. This motivates 
considering other weaker (and perhaps, more realistic) models of dynamic networks. In fact, it is 
not clear whether one can solve the problem significantly faster even in an offline setting, in which 
the network can change arbitrarily each round, but the entire evolution is known to the algorithm 
in advance. Our next contribution takes a step in resolving this basic question for token-forwarding 
algorithms. 

• We present a polynomial-time offline token-forwarding algorithm that solves the /c-gossip 
problem on an n-node dynamic network in 0{m.m{nk,n^/klogn}) rounds with high proba- 
bility. 

• We also present a polynomial-time offline token-forwarding algorithm that solves the A;-gossip 
problem in a number of rounds within an 0{n^) factor of the optimal, for any e > 0, assuming 
the algorithm is allowed to transmit O(logn) tokens per round. 

The above upper bounds show that in the offline setting, token-forwarding algorithms can achieve 
a time bound that is within 0{^/klogn) of the information-theoretic lower bound of 0(n + k), 
and that we can approximate the best token- forwarding algorithm to within a 0{rf) factor, given 
logarithmic extra bandwidth per edge. 

1.2 Related work 

Information spreading (or dissemination) in networks is one of the most basic problems in computing 
and has a rich literature. The problem is generally well-understood on static networks, both for 
interconnection networks [M] as well as general networks j35l [6]. In particular, the fc-gossip problem 
can be solved in 0{n-\-k) rounds on any n-static network [38]. There also have been several papers 
on broadcasting, multicasting, and related problems in static heterogeneous and wireless networks 
(e.g., see 13 [El O [22| ) . 

Dynamic networks have been studied extensively over the past three decades. Some of the early 
studies focused on dynamics that arise out of faults, i.e., when edges or nodes fail. A number of 
fault models, varying according to extent and nature (e.g., probabilistic vs. worst-case) and the 
resulting dynamic networks have been analyzed (e.g., see Ol35j). There have been several studies 
on models that constrain the rate at which changes occur, or assume that the network eventually 
stabilizes (e.g., see [D [251 [26] ) . 

There also has been considerable work on general dynamic networks. Some of the earliest studies 
in this area include [21 [13] which introduce general building blocks for communication protocols on 
dynamic networks. Another notable work is the local balancing approach of for solving routing 
and multicommodity flow problems on dynamic networks. Algorithms based on the local balancing 
approach continually balance the packet queues across each edge of the network and drain packets 
that have reached their destination. The local balancing approach has been applied to achieve 
near-optimal throughput for multicast, anycast, and broadcast problems on dynamic networks as 
well as for mobile ad hoc networks [TUl [121 [2H| • 
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Modeling general dynamic networks has gained renewed attention with the recent advent of 
heterogeneous networks composed out of ad hoc, and mobile devices. To address the unpredictable 
and often unknown nature of network dynamics, |33j introduce a model in which the communication 
graph can change completely from one round to another, with the only constraint being that the 
network is connected at each round. The model of [33] allows for a much stronger adversary than 
the ones considered in past work on general dynamic networks [TUl \T7\ . In addition to results on 
the /c-gossip problem that we have discussed earlier, [33j consider the related problem of counting, 
and generalize their results to the T-interval connectivity model, which includes an additional 
constraint that any interval of T rounds has a stable connected spanning subgraph. The survey 
of [32] summarizes recent work on dynamic networks. 

We note that the model of ^33j, as well as ours, allow only edge changes from round to round 
while the nodes remain fixed. Recently, the work of [?J introduced a dynamic network model 
(motivated by P2P networks) where both nodes and edges can change by a large amount (up to 
a linear fraction of the network size). They show that stable almost-everywhere agreement can be 
efficiently solved in such networks even in adversarial dynamic settings. 

Recent work of \27\ [28] presents information spreading algorithms based on network coding [3] . 
As mentioned earlier, one of their important results is that the A:-gossip problem on the adversarial 
model of [33 can be solved using network coding in 0(n + k) rounds assuming the token sizes are 
sufficiently large (r2(n log n) bits). For further references to using network coding for gossip and 
related problems, we refer to the recent works of [271 [2E1 El HZl [Ml [36] and the references therein. 

Our offline approximation algorithm makes use of results on the Steiner tree packing problem 
for directed graphs [2T]. This problem is closely related to the directed Steiner tree problem (a 
major open problem in approximation algorithms) |191 [39] and the gap between network coding 
and flow-based solutions for multicast in arbitrary directed networks [3, S7\. 

Finally, we note that there are also a number of studies that solve /c-gossip and related problems 
using gossip-based processes. In a local gossip-based algorithm, each node exchanges information 
with a small number of randomly chosen neighbors in each round. Gossip-based processes have 
recently received significant attention because of their simplicity of implementation, scalability to 
large network size, and their use in aggregate computations, e.g., [161 Ell [SH [20l [301 133 [E] and 
the references therein. All these studies assume an underlying static communication network, and 
do not apply directly to the models considered in this paper. A related recent work on dynamic 
networks is ^ which analyzes the cover time of random walks on dynamic networks. 

2 Model and problem statement 

In this section, we formally define the A;-gossip problem, the online and offline models, and token- 
forwarding algorithms. 

The k-gossip problem. In this problem, k different tokens are assigned to a set V n > k nodes, 
where each node may have any subset of the tokens, and the goal is to disseminate all the k tokens 
to all the nodes. 

The onhne model. Our online model is the worst-case adversarial model of |33j . Nodes communicate 
with each other using anonymous broadcast. We assume a synchronized communication. At the 
beginning of round r, each node in V decides what message to broadcast based on its internal state 
and coin tosses (for a randomized algorithm); the adversary chooses the set of edges that forms 
the communication network Gr over V for round r. We adopt a strong adversary model in which 
adversary knows the outcomes of the random coin tosses used by the algorithm in round r at the 
time of constructing Gr but is unaware at this time of the outcomes of any randomness used by 



3 



the algorithm m future rounds. The only constraint on Gr is that it be connected; this is the same 
as the 1-interval connectivity model of [33] • 

As observed in [33], the above model is equivalent to the adversary knowing the messages to 
be sent in round r before choosing the edges for round r. We do not place any bound on the size 
of the messages, but require for our lower bound that each message contains at most one token. 
Finally, we note that under the strong adversary model, there is a distinction between randomized 
algorithms and deterministic algorithms since a randomized algorithm may be able to exploit the 
fact that in any round r, while the adversary is aware of the randomness used in that round, it 
does not know the outcomes of any randomness used in subsequent rounds. 

The offline model. In the offline model, we are given a sequence of networks {Gr) where Gr is a 
connected communication network for round r. As in the online model, we assume that in each 
round at most one token is broadcast by any node. It can be easily seen that the /c-gossip problem 
can be solved in nk rounds in the offline model; so we may assume that the given sequence of 
networks is of length at most nk. 

Token-forwarding algorithms. Informally, a token-forwarding algorithm is one that does not com- 
bine or alter tokens, only stores and forwards them. Formally, we call an algorithm for A:-gossip a 
token- forwarding algorithm if for every node v, token t, and round r, v contains t at the start of 
round r of the algorithm if and only if either v has t at the start of the algorithm or v received a 
message containing t prior to round r. 

Finally, several of our arguments are probabilistic. We use the term "with high probability" 
to mean with probability at least 1 — l/n'^, for a constant c that can be made sufficiently high by 
adjusting related constant parameters. 

3 Lower bound for online token- forwarding algorithms 

In this section, we give an Q{kn/ logn) lower bound on the number of rounds needed by any online 
token-forwarding algorithm for the A;-gossip problem against a strong adversary. As discussed 
earlier, this immediately implies the same lower bound for any deterministic online token-forwarding 
algorithm. Our lower bound applies to even centralized algorithms and a large class of initial token 
distributions. We first describe the adversary strategy. 

Adversary: The strategy of the adversary is simple. We use the notion of free edge introduced 
in [33]. In a given round r, we call an edge {u,v) to be a free edge if at the start of round r, u has 
the token that v broadcasts in the round and v has the token that u broadcasts in the rouncQ an 
edge that is not free is called non-free. Thus, if (n, v) is a free edge in a particular round, neither 
u nor V can gain any new token through this edge in the round. Since we are considering a strong 
adversary model, at the start of each round, the adversary knows for each node v, the token (if 
any) that v will broadcast in that round. In round r, the adversary constructs the communication 
graph Gr as follows. First, the adversary adds all the free edges to Gr- Let Ci, C2, . . . , C; denote 
the connected components thus formed. The adversary then guarantees the connectivity of the 
graph by selecting an arbitrary node in each connected component and connecting them in a line. 
Figure [T] illustrates the construction. 

The network Gr thus constructed has exactly / — 1 non-free edges, where / is the number of 
connected components formed by the free edges of Gr. If {u,v) is a non-free edge in Gr, then n, 
V, or both will gain at most new token through this edge. We refer to such a token exchange on a 
non-free edge as a useful token exchange. 

^For convenience, when a node does not broadcast any token we will view it as broadcasting a special empty token 
that every node has. This allows us to avoid treating the empty broadcast as a special case. 
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We bound the running-time of any token-forwarding algorithm by identifying a critical structure 
that quantifies the progress made in each round. We say that a sequence of nodes vi,V2, ■ ■ ■ ,Vk is 
half-empty in round r with respect to a sequence of tokens ti,t2, ■ ■ ■ ,tk if the following condition 
holds at the start of round r: for all 1 < i, j < k, i ^ j, either Vi is missing tj or vj is missing U. We 
then say that (vi) is half-empty with respect to (tj) and refer to the pair {{vi), (tj)) as a half-empty 
configuration of size k. 



non-free edge 




LJ 



nor-free edge 




non-free edge 



non-free edge 




V V3 




Vl 



Figure 1: The network constructed by the adversary in a particular round. Note that if node Vi 
broadcasts token ti, then the {vi) forms a half-empty configuration with respect to (tj) at the start 
of this round. 



Lemma 1. If m useful token exchanges occur in round r, then there exists a half-empty configura- 
tion of size at least m/2 -\-\ at the start of round r. 

Proof. Consider the network Gr in round r. Each non-free edge can contribute at most 2 useful 
token exchanges. Thus, there are at least m/2 non-free edges in the communication graph. Based on 
the adversary we consider, no useful token exchange takes place within the connected components 
induced by the free edges. Useful token exchanges can only happen over the non-free edges between 
connected components. This implies there are at least m/2 -\- 1 connected components in the 
subgraph of Gr induced by the free edges. Let Vi denote an arbitrary node in the ith connected 
component in this subgraph, and let tj be the token broadcast by Vi in round r. For i ^ j, since Vi 
and Vj are in different connected components, {vi,Vj) is a non-free edge in round r; hence, at the 
start of round r, either Vi is missing tj or Vj is missing ti. Thus, the sequence {vi) of nodes of size 
at least m/2 -\- 1 is half-empty with respect to the sequence {ti) at the start of round r. □ 

An important point to note about the definition of a half-empty configuration is that it only 
depends on the token distribution; it is independent of the broadcast in any round. This allows us 
to prove the following easy lemma. 

Lemma 2. If a sequence {vi) of nodes is half-empty with respect to {ti) at the start of round r, 
then {vi) is half-empty with respect to {ti) at the start of round r' for any r' < r. 

Proof. The lemma follows immediately from the fact that if a node Vi is missing a token tj at the 
start of round r, then Vi is missing token tj at the start of every round r' < r. □ 

Lemmas [T] and [2] suggest that if we can identify a token distribution in which all half-empty 
configuration are small, we can guarantee small progress in each round. We now show that there 
are many token distributions with this property, thus yielding the desired lower bound. 
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Theorem 3. From an initial token distribution in which each node has each token independently 
with probability 3/4, any online token- forwarding algorithm will need Q{kn/ logn) rounds to com- 
plete with high probability against a strong adversary. 

Proof. We first note tliat if the number of tokens k is less than 100 log n, then the ^}{kn/ logn) 
lower bound is trivially true because even to disseminate one token it will take r2(n) rounds in the 
worst-case. Thus, in the following proof, we focus on the case where k > 100 log n. 

Let El denote the event that there exists a half-empty configuration of size / at the start of the 
first round. For Ei to hold, we need I nodes vi,V2, ■ ■ ■ ,vi and I tokens ti,t2, ■ ■ ■ ,ti such that for all 
i j either Vi is missing tj or vj is missing tj. For a pair of nodes u and v, by union bound, the 
probability that u is missing tv or v is missing tu is at most 1/4-1-1/4 = 1/2. Thus, the probability 
of El can be bounded as follows. 

„ fn\ k\ , 1 22'i°g" 



IJ {k-l)\ \2J - 2^{i-i)/2 - 2Ki-i)/2- 

In the above inequality, (") is the number of ways of choosing the / nodes that form the half-empty 

configuration, kl/{k — 1)1 is the number of ways of assigning I distinct tokens, and (l/2)(2) is the 
upper bound on the probability for each pair i ^ j that either vi is missing tj or Vj is missing 
tj. For / = 51ogn, Pr [i?/] < l/n^. Thus, the largest half-empty configuration at the start of the 
first round, and hence at the start of any round, is of size at most 51ogn with probability at least 
1 — 1/n^. By Lemma [l| we thus obtain that the number of useful token exchanges in each round 
is at most 10 log n, with probability at least 1 — 1/n^. 

Let Mi be the number of tokens that node i is missing in the initial distribution. Then Mi is 
a binomial random variable with E [Mi] = k/A. By a straightforward Chernoff bound, we have the 
probability that node i misses less than fc/8 tokens is 



Pr 



Mi < 



Pr 



Mi < ( 1 - ^ y E [M, 



< e 2 = e 32 . 



Therefore, the total number of tokens missing in the initial distribution is at least n ■ k/8 = Q(kn) 

k n 

with probability at least 1 — n/eaz > 1 — 1/n (A; > 100 log n). Since the number of useful tokens 
exchanged in each round is at most 10 log n, the number of rounds needed to complete A:-gossip is 
Q{kn/ logn) with high probability. □ 

Theorem[3]does not apply to certain natural initial distributions, such as one in which each token 
resides at exactly one node. While this class of token distributions has far fewer tokens distributed 
initially, the argument of Theorem |3] does not rule out the possibility that an algorithm, when 
starting from a distribution in this class, avoids the problematic configurations that arise in the 
proof. In the following. Theorem [5] extends the lower bound to this class of distributions. 

Lemma 4. From any distribution in which each token starts at exactly one node and no node has 
more than one token, any online token-forwarding algorithm for k-gossip needs n{kn/ logn) rounds 
against a strong adversary. 

Proof. We consider an initial distribution C where each token is at exactly one node, and no node 
has more than one token. Let C* be an initial token distribution from which any online algorithm 
needs ^}{kn/ logn) rounds. The existence of C* follows from Theorem [sj We construct a bipartite 
graph on two copies of V, Vi and V2. A node v £ Vi is connected to a node u £ V2 in C* u has 
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all the tokens that v has in C. We will show below that this bipartite graph has a perfect matching 
with positive probability. 

Given a perfect matching M, we can complete the proof as follows. For u G V2, let M{v) denote 
the node in Vi that got matched to v. If there is an algorithm A that runs in T rounds from 
starting state C, then we can construct an algorithm A* that runs in the same number of rounds 
from starting state C* as follows. First every node v deletes all its tokens except for those which 
M{v) has in C. Then algorithm A* runs exactly as A. Thus, the lower bound of Theorem [3| which 
applies to A* ^ also applies to A. 

It remains to prove that the above bipartite graph has a perfect matching. This follows from an 
application of Hall's Theorem. Consider a set of m nodes in V2. We want to show their neighborhood 
in the bipartite graph is of size at least m. We show this condition holds by the following 2 cases. 
If m < 3n/5, let Xi denote the neighborhood size of node i. We know E [Xi] > 3n/4. Then by 
Chernoff bound 

, -, (1/5)2e[X^] 

Pr [Xi < m] < Pr [Xi < 3n/5] < e 2 = e 200 

By union bound with probability at least 1 — n ■ e-3"/200 ^j^g neighborhood size of every node is at 
least m. Therefore, the condition holds in the first case. If m > 3n/5, we argue the neighborhood 
size of any set of m nodes is Vi with high probability. Consider a set of m nodes, the probability 
that a given token t is missing in all these m nodes is (1/4)™. Thus the probability that any token 
is missing in all these nodes is at most n(l/4)™ < n(l/4)^"'/^. There are at most 2" such sets. By 
union bound, with probability at least 1 — 2" • n(l/4)^"/^ = 1 — n/2"/^, the condition holds in the 
second case. □ 

Theorem 5. From any distribution in which each token starts at exactly one node, any online 
token-forwarding algorithm for k-gossip needs Q{kn/ logn) rounds against a strong adversary. 

Proof. In this theorem, we extend our proof in Lemma [4] to the inital distibution C where each 
token starts at exactly one node, but nodes may have multiple tokens. We prove this theorem by 
the following two cases. 

First case, when at least n/2 nodes start with some token. This implies that k > n/2. Focus 
on the n/2 nodes with tokens. Each of them has at least one unique token. By the same argument 
used in Lemma |4[ disseminating these n/2 distinct tokens to n nodes takes r2(n^/logn) rounds. 
Thus, in this case the number of rounds needed is f](/cn/ log n). 

Second case, when less than n/2 nodes start with some token. In this case, the adversary can 
group these nodes together, and treat them as one super node. There is only one edge connecting 
this super node to the rest of the nodes. Thus, the number of useful token exchange provided by 
this super node is at most one in each round. If there exsits an algorithm that can disseminate k 
tokens in o(A;n/ log n) rounds, then the contribution by the super node is o(/cn/ log n). And by the 
same argument used in Lemma |4] we know dissemination k tokens to n/2 nodes (those start with 
no tokens) takes Q{kn/ logn) rounds. Thus, the theorem also holds in this case. □ 



4 Subquadratic time offline token-forwarding algorithms 



In this section, we give two centralized algorithms for the fc-gossip problem in the offline model. We 
present an 0(min{n^'^\/Iogn, nA:}) round algorithm in Section 4.1 Then we present a bicriteria 



(©(n*^), logn)-approximation algorithm in Section 4.2, which means if L is the number of rounds 



needed by an optimal algorithm where one token is broadcast by every node per round, then our 
approximation algorithm will complete in O(n^L) rounds and the number of tokens broadcast by 
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any node is O(logn) in any given round. Both of these algorithms uses a directed capacitated 
leveled graph constructed from the sequence of communication graphs which we call the evolution 
graph. 

Evolution graph: Let V be the set of nodes. Consider a dynamic network of / rounds numbered 1 
through I and let Gi be the communication graph for round i. The evolution graph for this network 
is a directed capacitated graph G with 21 + 1 levels constructed as follows. We create 21 + 1 copies 
of V and call them Vq, V2, . . . , V2i. Vi is the set of nodes at level i and for each node u in y, we call 
its copy in Vi as Vi. For z = 1, . . . , /, level 2i — 1 corresponds to the beginning of round i and level 
2i corresponds to the end of round i. Level corresponds to the network at the start. Note that 
the end of a particular round and the start of the next round are represented by different levels. 
There are three kinds of edges in the graph. First, for every round i and every edge (u, v) G Gi, we 
place two directed edges with unit capacity each, one from U2i-i to V2i and another from V2i-i to 
U2i- We call these edges broadcast edges as they will correspond to broadcasting of tokens; the unit 
capacity on each such edge will ensure that only one token can be sent from a node to a neighbor 
in one round. Second, for every node v in V and every round i, we place an edge with infinite 
capacity from ^2(1-1) to ^2i- We call these edges buffer edges as they ensure tokens can be stored 
at a node from the end of one round to the end of the next. Finally, for every node v € V and 
every round i, we also place an edge with unit capacity from f2(j_i) to V2i~i- We call these edges as 
selection edges as they correspond to every node selecting a token out of those it has to broadcast 
in round i; the unit capacity ensures that in a given round a node must send the same token to all 
its neighbors. Figure [2] illustrates our construction, and Lemma [6] explains its usefulness. 

^ broadcast edge (unit capacity) 

, . selection edge (unit capacity) 

buffer edge (infinite capacity) 



(B) (D) 

\y (c) — ^ 




Figure 2: An example of how to construct the evolution graph from a sequence of communication 
graphs. 



Lemma 6. Let there be k tokens, each with a source node where it is present in the beginning and 
a set of destination nodes to whom we want to send it. It is feasible to send all the tokens to all 
of their destination nodes in a dynamic network using I rounds, where in each round a node can 
broadcast only one token to all its neighbors, if and only if k directed Steiner trees can be packed 
in the corresponding evolution graph with 21 + 1 levels respecting the edge capacities, one for each 
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token with its root being the copy of the source node at level and its terminals being the copies of 
the destination nodes at level 21. 

Proof. Assume that k tokens can be sent to all of their destinations in / rounds and fix one broadcast 
schedule that achieves this. We will construct k directed Steiner trees as required by the lemma 
based on how the tokens reach their destinations and then argue that they all can be packed in 
the evolution graph respecting the edge capacities. For a token i, we construct a Steiner tree T* 
as follows. For each level j G {0, . . . , 21}, we define a set Sj of nodes at level j inductively starting 
from level 21 backwards. 5*2; is simply the copies of the destination nodes for token i at level 21. 
Once S2(^j_^-^^ is defined, we define S^j (respectively 5*2^+1) as: for each V2(j+i) ^ 5'2q_,_]^^, include V2j 
(respectively nothing) if token i has reached node v after round j, or include a node U2j (respectively 
U2j+i) such that u has token i at the end of round j which it broadcasts in round j + 1 and (n, v) 
is an edge of Gj+i. Such a node u can always be found because whenever V2j is included in S^j, 
node V has token i by the end of round j which can be proved by backward induction staring from 
j = I. It is easy to see that Sq simply consists of the copy of the source node of token i at level 
0. T* is constructed on the nodes in U^~q'5*. If for a vertex v, f2(j+i) ^ '^'2(^+1) ^^'^ G S2j, we 
add the buffer edge (^2^,^2(^+1)) in T*. Otherwise, if 1^20+1) ^ '^2(j+i) "^"^J ^ ^2j^ 
selection edge {u2j,U2j+i) and broadcast edge (1^2^+1 ; ^^2(j+i)) where u was the node chosen 

as described above. It is straightforward to see that these edges form a directed Steiner tree for 
token i as required by the lemma which can be packed in the evolution graph. The argument is 
completed by noting that any unit capacity edge cannot be included in two different Steiner trees as 
we started with a broadcast schedule where each node broadcasts a single token to all its neighbors 
in one round, and thus all the k Steiner trees can be simultaneously packed in the evolution graph 
respecting the edge capacities. 

Next assume that k Steiner trees as in the lemma can be packed in the evolution graph respecting 
the edge capacities. We construct a broadcast schedule for each token from its Steiner tree in the 
natural way: whenever the Steiner tree Tj corresponding to token i uses a broadcast edge (u2j-i, V2j) 
for some j, we let the node u broadcast token i in round j. We need to show that this is a feasible 
broadcast schedule. First we observe that two different Steiner trees cannot use two broadcast 
edges starting from the same node because every selection edge has unit capacity, thus there are 
no conflicts in the schedule and each node is asked to broadcast at most one token in each round. 
Next we claim by induction that if node V2j is in T*, then node v has token i by the end of round j. 
For j = 0, it is trivial since only the copy of the source node for token i can be included in T* from 
level 0. For j > 0, if V2j is in T*, we must reach there by following the buffer edge {v2(j-i),V2j) or 
a broadcast edge {u2j-i, V2j)- In the former case, by induction node v has token i after round j — 1 
itself. In the latter case, node u which had token i after round j — 1 by induction was the neighbor 
of node v in Gj and u broadcast token i in round j, thus implying node v has token i after round j. 
From the above claim, we conclude that whenever a node is asked to broadcast a token in round j, 
it has the token by the end of round j — 1. Thus the schedule we constructed is a feasible broadcast 
schedule. Since the copies of all the destination nodes of a token at level 21 are the terminals of its 
Steiner tree, we conclude all the tokens reach all of their destination nodes after round I. □ 

4.1 An 0{min{n^/k\ogn,nk}) round algorithm 

Our algorithm is given in Algorithm [T] and analyzed in Lemma [7] and |8j 

Lemma 7. Let there be k < n tokens at given source nodes and let v be an arbitrary node. Then, 
all the tokens can be sent to v using broadcasts in 0{n) rounds. 
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Figure 3: An example of building directed Steiner tree in the evolution graph G based on token 
dissemination process. Token t starts from node B. Thus, the Steiner tree is rooted at Bq in G. 
Since Bq has token t, we include the infinite capacity buffer edge {Bq, B2). In the first round, node 
B broadcasts token t, and hence we include the selection edge {Bq,Bi). Nodes A and G receive 
token t from B in the first round, so we include edges ^2)5 (-Bi,C2). Now A2, B2, and G2 all 
have token t. Therefore we include the edges {A2, A4), {B2, -B4), and (C2, C4). In the second round, 
all of A, B, and G broadcast token t, we include edges (^2,^3)) (-62,-63), (C2,C3). Nodes D and 
E receive token t from G. So we include edges (C3, D4) and (C3, £'4). Notice that nodes A and B 
also receive token t from C, but they already have token t. Thus, we don't include edges {G^jB^) 
or (C3,^4). 



Proof. By lemma [6| we will be done in n + k rounds if we can show that k paths, one from 
every source vertex at level to t'2(n+fc)) can be packed in the corresponding evolution graph with 
2(n + A:) + 1 levels respecting the edge capacities. For this, we consider the evolution graph and 
add to it a special vertex v-i at level —1 and connect it to every source at level by an edge of 
capacity 1. (Multiple edges get fused with corresponding increase in capacity if multiple tokens 
have the same source.) We claim that the value of the min-cut between v-i and f2(n+A:) is at least 
k. Before proving this, we complete the proof of the claim assuming this. By the max fiow min cut 
theorem, the max flow between v-i and V2(^n+k) is at least k. Since we connected f_i with each of 
the k token sources at level by a unit capacity edge, it follows that unit flow can be routed from 
each of these sources at level to f2(n+fc) respecting the edge capacities. It is easy to see that this 
implies we can pack k paths, one from every source vertex at level to V2(n+k)^ respecting the edge 
capacities. 

To prove our claimed bound on the min cut, consider any cut of the evolution graph separating 
V-i from f2(n+fc) and let S be the set of the cut containing v-i. If S includes no vertex from level 
0, we are immediately done. Otherwise, observe that if V2j G S for some < j < {n + k) and 
W2(j+i) ^ then the value of the cut is inflnite as it cuts the buffer edge of infinite capacity out 
of V2j- Thus we may assume that if V2j £ S, then ^20+1) ^ S. Also observe that since each of 
the communication graphs Gi, . . . , Gn+k are connected, if the number of vertices in S from level 
2(j + 1) is no more than the number of vertices from level 2j and not all vertices from level 2(j + 1) 
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are in S, we get at least a contribution of 1 in the value of the cut. But since the total number of 
nodes is n and V2(n+k) ^ there must be at least k such levels, which proves the claim. □ 



Algorithm 1 0{uim{ny/k log n, nk}) round algorithm in the offline model 
Require: A sequence of communication graphs Gi, i = 1,2, .. . 
Ensure: Schedule to disseminate k tokens. 

1: if A; < i/logn then 

2: for each token t do 

3: For the next n rounds, let every node who has token t broadcast the token. 

4: end for 

5: else 

6: Choose a set S of 2^/kJogn random nodes. 

7: for each vertex in v G S do 

8: Send each of the k tokens to vertex v in 0{n) rounds. 

9: end for 

10: for each token t do 

11: For the next 2n^y (log n)/k rounds, let every node who has token t broadcast the token. 

12: end for 

13: end if 



Theorem 8. Algorithm^ solves the k-gossip problem using 0(min{n-^A; logn, nk}) rounds with 
high probability in the offline model. 

Proof. It is trivial to see that if < ylogn, then the algorithm will end in nk rounds and each 
node receives all the k tokens. Assume k > ^/^ogn. By Lemma [7| all the tokens can be sent to 
all the nodes in 5 using 0{ny/klogn) rounds. Now fix a node v and a token t. Since token t is 
broadcast for 2ny^ [log n)/k rounds, there is a set 5* of at least 2n^y {logn)/k nodes from which v 
is reachable within those rounds. It is clear that if S intersects S**, v will receive token t. Since the 
set S was picked uniformly at random, the probability that S does not intersect 5"* is at most 

in—2nyj (log n) / fc\ 
V 2v'fclogn 

(2v'fclog n) 

Thus every node receives every token with probability 1 — Xjr? . It is also clear that the algorithm 
finishes in Oin^Jk log n) rounds. □ 

Algorithm [l] can be derandomized using the standard technique of conditional expectations, 
shown in Algorithm [2j Given a sequence of communication graphs, if node u broadcasts token 
t for A rounds and every node that receives token t also broadcasts t during that period, then 
we say node v is within A broadcast distance to u if and only if v receives token t by the end 
of round A. Let 5 be a set of nodes, and |5| < 2y/klogn. We use Pr [u; S';T] to denote the 
probability that the broadcast distance from node u to set X is greater than 2n-\/ {logn)/k, where 
X = S U {pick 2y/k log n — \ S\ nodes uniformly at random from V\T^, and ^(5", T) denotes the 
sum, over all u in V, of Pr [u; S;T]. 

Lemma 9. The set S returned by Algorithm ^ contains at most 2yJk\ogn nodes, and the broadcast 
distance from every node to S is at most 2ny^{logn)/k. 



n — 2n^ (log n)//c 



k: log 11 



< 



n 



n 



4 ■ 
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Algorithm 2 Derandomized algorithm for Step [6] in Algorithm [T] 
Require: A sequence of communication graphs Gi, i = 1,2, .. ., and k > yTogn 
Ensure: A set of 2^/k\ogn nodes S such that the broadcast distance from every node n to 5 is 
within 2n^yJlogny/k. 

1: Set S and T be 0. 

2: for each v £ V do 
3: T = TLl{v} 

4: if P{S U{v}, T) < P{S,T) then 
5: 5 = 5UH 
6: end if 

7: end for 

8: return S 



Proof. Let us view the process of randomly selecting 2-^k log n nodes as a computation tree. This 
tree is a complete binary tree of height n. There are n + 1 nodes on any root-leaf path. The level 
of a node is its distance from the root. The computation starts from the root. Each node at the 
ith level is labeled by 6j G {0, 1}, where means not including node i in the final set and 1 means 
including node i in the set. Thus, each root-leaf path, 6162 ■ ■ - bn, corresponds to a selection of 
nodes. For a node a in the tree, let Sa (resp., Ta) denote the sets of nodes that are included (resp., 
lie) in the path from root to o. 

By Theorem [sj we know that for the root node r, we have P(0, 5^) = P(0,0) < If 
c and d are the children of a, then Tc = T^, and there exists a real < p < 1 such that for 
each u in V, Pr [u; Sa;Ta] equals pFv [u; Sc;Tc] + {1 — p) Pr [u; Sd;Td]. Therefore, P{Sa, Ta) equals 
pPiSc,T,) + {l-p)P{Sd,Ta). We thus obtain that min{P(Se, T,), Trf)} <P(5a,T,). Since we 

set S to be A in {Sc, Sd} that minimizes P{X, Tc), we maintain the invariant that P{S, T) < 
In particular, when the algorithm reaches a leaf /, we know P{Si, V) < 1 jv? . But a leaf I corresponds 
to a complete node selection, so that Pr \u; Si-y\ is or 1 for all u, and hence P[Si, V) is an integer. 
We thus have P{Si, V) = 0, implying that the broadcast distance from node u to set Si is at most 
2n\J {\o^n)lk for every /. Furthermore, is 2k^J\o^n by construction. 

Finally, note that Step [4] of Algorithm [2] can be implemented in polynomial time, since for each 
u in V , Pr [u; S;T\ is simply the ratio of two binomial coefficients with a polynomial number of bits. 
Thus, Algorithm [2] is a polynomial time algorithm with the desired property. □ 

4.2 An (0(ra''), log n)-approximation algorithm 

Here we introduce an (0(n'^), log n)-approximation algorithm for the fc-gossip problem in the offline 
model. This means, if the /c-gossip problem can be solved on any n-node dynamic network in L 
rounds, then our algorithm will solve the /c-gossip problem on any dynamic network in 0{rfL) 
rounds, assuming each node is allowed to broadcast O(logn) tokens, instead of one, in each round. 
Our algorithm is an LP based one, which makes use of the evolution graph defined earlier. The 
following is a straightforward corollary of Lemma [6j 

Corollary 10. The k-gossip problem can be solved in I rounds if k directed Steiner trees can be 
packed in the corresponding evolution graph, where for each token, the root of its Steiner tree is a 
source node at level 0, and the terminals are all the nodes at level 21. 

Packing Steiner trees in general directed graphs is NP-hard to approximate even within Q.{rn}/'^~'') 
for any e > ^21], where m is the number of edges in the graph. Thus, our algorithm focuses on 
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solving Steiner tree packing problem with relaxation on edge capacities, allowing the capacity to 
blow up by a factor of O(logn). First, we write down the LP for the Steiner tree packing problem 
(maximizing the number of Steiner trees packed with respect to edge capacities) . Let T be the set 
of all possible Steiner trees, and Ce be the capacity of edge e. For each Steiner tree T G T, we 
associate a variable xt with it. If xt = 1, then Steiner tree T is in the optimal solution; if xt = 0, 
it's not. After relaxing the integral constraints on x^'s, we have the following LP, referred to as V 
henceforth. Let F{V) denote the optimal fractional solution for V. 

max StsT 

S.t. Ylr-e^T^T 1^ Ce ^/c £ E 

XT>o VT G r 

Lemma 11 ([2T|). There is an 0{n'^)- approximation algorithm for the fractional maximum Steiner 
tree packing problem in directed graphs. 

Let L be the number of rounds that an optimal algorithm uses with every node broadcasting 
at most one token per round. We give an algorithm that takes 0{n^L) rounds with every node 
broadcasting O(logn) tokens per round. Thus ours is an (0{n^), O(logn)) bicriteria approximation 
algorithm, shown in Algorithm [3| 

Algorithm 3 (©(n*^), 0(logn))-approximation algorithm 
Require: A sequence of communication graphs Gi, G2, . . . 
Ensure: Schedule to disseminate k tokens. 

1: Initialize the set of Steiner trees 5 = 0. 
2: for i = 1 ^ 2n^ do 

3: Find L* such that with the evolution graph G constructed from level to level 2L* , the 
approximate value for F(V) is k/n^. In this step, we use the algorithm of |21j to approximate 
F{V). 

4: Let be the value of the variable xt in the solution from step |3j The number of non- 
zero x^'s is polynomial with respect to k. Using randomized rounding, with probability x^ 
include T in the solution, S = S U {T}. Otherwise, don't include T. 

5: Remove communication graphs Gi, G2, ■ ■ ■ , Gl* from the sequence, and reduce the remaining 
graphs' indices by L*. 

6: end for 

7: Use Corollary |10| to convert the set of Steiner trees S into a token dissemination schedule. 



Theorem 12. Algorithm^ achieves an 0{n'') approximation to the k-gossip problem while broad- 
casting O(logn) tokens per round per node, with high probability. 

Proof. We show the following three claims: (i) In Step[7j |5| > k with probability at least 1 — 1/e^^^. 
This is the correctness of Algorithm |3j saying it can find the schedule to disseminate all k tokens, 
(ii) The number of rounds in the schedule produced by Algorithm [3] is at most 0{n'') times the 
optimal one. (iii) In the token dissemination schedule, the number of tokens sent over an edge is 
O(logn) in any round with high probability. 



First, we prove claim (i). Let Xi denote the sum of non-zero x^'s in iteration i. X = Y2'i=i 



We know E [Xi] = k/n". Thus, E [X] = 2n^k/n' 
trees in set S. By Chernoff bound, we have 



2k, which is the expected number of Steiner 



Pr [X <k]= Pr 



X < [l 



E[X] 



< e 



(l/2)^E[X] 
2 



(l/2)^-2fc 



pA:/4 
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Thus, \S\ > k with probabihty at least 1 — Xje^l^ in Stepj?} 

Next we prove claim (ii). Let L denote the number of rounds needed by an optimal algorithm. 
Since in Stepjsjwe used the 0(?i'^)-approximation algorithm in ^21j to solve F(V), we know L* < L. 
There are 2n^ iterations. Thus, the number of rounds needed by Algorithm [3] is at most 2n^L* < 
2rfL, which is an 0(n'^)-approximation on the number of rounds. 

Lastly we prove claim (iii) . When Algorithm |3] does randomized rounding in Step |4| some 
constraint ^T-eeT-^T ^ Cg in 7^ may be violated. In the evolution graph, Cg = 1. Let Y denote the 
sum of x^'s in this constraint. We have E \Y] < Cg = 1. By Chernoff bound. 



Pr [y > E[y] +logn] = Pr 



y > 1 + 



log n 

E[y] 



E[y] 



< 



-EM[(i+|¥f)iKi+^K)-|¥f] 



< 



1 

^log log n 



Thus, the number of tokens sent over a given edge is 0(log n) with probability at least 1 — 1 /n^°^'°s"'. 
Since there are only polynomial number of edges, no edge will carry more than O(logn) tokens in 
a single round with high probability. □ 



5 Conclusion and open questions 

In this paper, we studied the power of token-forwarding algorithms for gossip in dynamic networks. 
We showed a lower bound of Q{nk/\ogn) rounds for any online token forwarding algorithm against 
a strong adversary; our bound matches the known upper bound of 0{nk) up to a logarithmic 
factor. This leaves us with an important open question: what is the complexity of randomized 
online token-forwarding algorithms against a weak adversary that is unaware of the randomness 
used by the algorithm in each round? We note that our lower bound also extends to randomized 
algorithms if the adversary is allowed to be adaptive; that is, the adversary is allowed to make 
its decision in each step with knowledge of the random coin tosses made by the algorithm in that 
step (but without knowledge of the randomness used in future steps). Furthermore, for small 
token sizes (e.g., O(logn) bits) even the best (randomized) online algorithm we know based on 
network coding takes 0(nA;/logn) rounds |28j . In contrast, we show that in the offline setting 
there exist centralized token-forwarding algorithms that run in ©(n^'^yTogn) time. An interesting 
open problem is to obtain tight bounds on offline token-forwarding algorithms. 
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